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In order to discern aggregation in solution, we present a quantum mechanical analogue of the 
photon statistics from fluorescent molecules diffusing through a focused beam. A novel generating 
functional is developed to fully describe the experimental physical system, as well as the statistics. 
Histograms of the measured time delay between photon counts are fit by an analytical solution 
describing the static as well as diffusing regimes. To determine empirical fitting parameters, fiuo- 
rescence correlation spectroscopy is used in parallel to the photon counting. For expediant analysis, 
we find that the distribution's deviation from a single Poisson shows a difference between two single 
fluor monomers or a double fluor aggregate of the same total intensities. Initial studies were pre- 
formed on fixed-state aggregates limited to dimerization. However preliminary results on reactive 
species suggest that the method can be used to characterize any aggregating system. 

PACS numbers: 81.15.K 



I. INTRODUCTION 

Aggregation and cooperative binding are fundamental to biological function and regulation, but difEcult to observe 
at the few molecule level. Recent advances in few molecule solution spectroscopy have been achieved by combining 
the comparitively large signals of fluorescence with recent technological advances in photon counting (e.g. low noise 
detectors). One powerful technique, fluorescence correlation spectroscopy (FCS), has enabled researchers to observe 
many small ensemble processes such as diffusion [2], molecular conformational dynamics [3], and reaction kinetics 
[4]. In FCS, fluctuations in fluorescence intensity are temporally correlated to reveal the timescale of the underlying 
fluctuation source. In the simplest case, this could be the diffusion timescale of a fluorescent molecule through a 
sampling volume. However, it is difficult to discern aggregation based on the diffusion time alone since the increase 
in diffusion time between a monomer or a dimer is weakly dependent on the increase in effective radius (oc< >^/^ 
for hard spheres) [5]. 



An alternative approach to FCS is the statistical analysis of the time series of emitted photons (number counting 
Fig 1), since the number distribution of photons in the detection volume at any given moment is different for species of 
different quantum yield or fluorophore number. This analysis, called photon number counting histogram analysis [7,8] 
was originally developed to detect the "brightness" per particle, but relied on the static (non-diffusing) limit. However 
in experiment, several factors potentially overshadow the specific brightness signature of a fluor, such as triplet states, 
bleaching, or quenching. We propose a new method of analysis, the time delay histogram, to discern small differences 
in a specie's fluorescence complemented by differences in diffusive behavior. The time delay histogram is constructed 
of the time between successive photon counts (Fig 1). Unlike counting the number of events per time bin, this type of 
counting allows us to extract additional information about the diffusion. Conceptually, as more fluorinated monomers 
aggregate, the fluorophores sample the detection volume in bunches. This increases the chance of a large time interval 
between two successive emission photons and the large At tail of the time delay histogram has an increased frequency. 

To extract the information on the structural change, we have developed a generating functional to unify different 
statistical aspects of the photon time series. This functional is modeled as a transition matrix element of a fictitious 
quantum mechanical system with time variable continuated to the imaginary axis. Many well developed techniques 
in quantum mechanics are borrowed to derive analytical expressions for other experimental observables such as the 
auto correlation function. Our approach is complementary to the description model proposed by Novikov and Boens 
[9] for the photon counting histogram. An extension of our functional can also be applied to modern multi-channel 
techniques [10,11] as well as the incorporation of secondary processes such as triplet states, bleaching, quenching, and 
chemical reaction kinetics. 

Recently Kask, et.al also proposed a method to extract the diffusion signature of a particle through fluorescence 
intensity multiple distribution analysis (FIMDA) [11]. In FIMDA, many histograms are constructed from the same 
time trace as the bin size is varied. In contrast, the time delay histogram we propose, contains the diffusion signature 
of the aggregate in a single histogram. At small time delays (less than the diffusion time through the collection 
volume) we are sensitive to the rate of photons emitted per object, and at greater delays, diffusion dominates the 
statistics. Finally, we offer a complementary technique to FIMDA using the Mandel Q parameter [1] to expediantly 
extract diffusion information from number counting distributions. The power of this technique could be increased 
by greater photon collection capabilities, leading to better statistics and ultimately better discrimination, including 
higher moments, in the large delay limit. 

To demonstrate the sensitivity of our analysis, we choose a case where the difference in diffusion times are negligible. 



A particularly difficult case is the discrimination of two monomers of identical fluorescence from a single dimer with the 
sum of their fluorescence. Although both have the same average rate of emitted photons, they sample the excitation 
beam profile differently as they diffuse (inhomogeneous intensity profile of a focused beam). For the experiment, a 
sequence of single stranded DNA is specifically tagged with either one or two fiuorophores per strand. The single dyed 
strands will be considered "monomers" and the doubled tagged as "dimers" . For all cases, we find good discrimination 
between samples of a given concentration of dimers versus that of twice the concentration of monomers, where both 
samples have the same average fluorescence. 



II. OVERVIEW OF APPROACH 

The time series of photon events may be described by the instantaneous fluorescence intensity, 

N 



I{t) = J2Se{t-tn) (2.1) 



= ^j\tl{t). (2.2) 



where ti,t2, ■■■,tn, --tN corresponds to the tick marks of Fig.l with N total number of photons counted and e is the 
detector resolution. 5f{t — tn) = \ for \t — tn\ < e and 6^{t — t„) = otherwise. In the theoretical analysis of the 
subsequent sections, we shall take the limit e — > so the instantaneous intensity becomes a random spike function. 
In the current literature statistical analysis tends to be limited to the average intensity, 

and various types of correlation functions, the most familiar being the auto-correlation function 

^^^^ ^Mi^jy + " ^1 " ^'^^^ ^^-^^ 

for T >> T, which decays with a characteristic time scale td, the diffusion time through the collection volume. The 
subtraction of 5e permits A{t) to vanish for all t for a Poissonian histogram. Dividing the integration time into Nb bins 
of equal time interval Th, i.e., T = N^T},, the number of photon counts falling within the time bins are mi, m2, mjVb, 
and their moments 

^i = j;fJ2< (2-4) 

carry the structural information of the underlying fluorescence molecules. The quantity for comparison, called the 
Mandel's Q parameter is deflned as 



where the first moment is given by Mi = iTb, etc. Although higher moments yield greater difFerientiation, we limit 
our analysis to the second moment due to the experimental collection capabilities (< 2^^ photons) of our system. The 
definition (2.3) and the relation m„ = Jf^\)^^ dtl{t) directly relates the Q parameter to the autocorrelation function 

S= — f " dsin - s)A{s). (2.6) 
n Jo 

Similar relations exist between higher order correlation functions and higher order binning moments. 

Finally we describe the time delay histogram of the distribution of time intervals between two successive photon 
events, i.e. Ati, At2,---- For sufficiently large numbers of photon counts, a distribution function of At, p{At) can be 
extracted and is analogous to the photon waiting time distribution of quantum optics. [6] The theory of this function 
will be developed in the sections IV and V and will be compared with experimental results in sections V and VI. 

III. A MATHEMATICAL THEORY OF TIME DELAY HISTOGRAM 

While there exist many articles in the literature on the mathematical properties of the photon event histogram 
[7,8], here we would like to provide a unified approach, which ties the experimental observables such as fluorescence 
intensity, auto-correlation, the binning moments, and time delay histogram to a probability generating functional. 

A. General formulation 

To begin, we divide the integration time T into Af bins, each of interval e, i.e., T = Me. Each fluorescence diffusion 

process produces a histogram of photon counting, {mi, totv/}, with mi the number of photon events within the l-ih. 
time bin and the corresponding fluorescence intensity given by 

k = ^ (4.1) 

Let Pmi...mM stand for the probability of this particular time delay histogram. The generating function of this set of 
probabilities is defined as 

g{z^,...,z^f)= Pm,...mj,zr...z^^, (4.2) 

mi,...,mx 

which is properly normalized, i.e.^(l, 1) = 1. In the limit e — > (or, equivalently A/' — > oo with a fixed T), the 
sequences {zi, zj^} and {/i, I_\f} become two functions of t, z{t) and /(i), such that z{le) — zi and I{le) = Ii. 
In particular, as e — > 0, most m's vanish, few of them are equal to one, and the probability of m; > 2 becomes 



negligible. The function I{t) approaches the random spike function introduced in the last section. In the same limit, 
the generating function (4.2) becomes a generating functional 

g[z{t)] = lim Gizu-.z^). (4.3) 

A/ — >(x> 

An important set of observables are various correlation functions given by the functional derivatives of Q with respect 
to z{t) at z{t) — 1, i.e. 

C„(*....,U = 4^...4;jlneWi)]U,)... (4.4) 
The function Ci (t) is nothing but the ensemble average of the instantaneous fluorescence intensity at the moment t, 

<I{t)>=C^{t), (4.5) 
the coefficient C2{ti,t2) is related to the auto-correlation function between ti and t2 via 

and the coeflicients C„(ti, f„)'s represent higher order correlations. For the observation times, ti,...,t_M- sufBciently 
away from the beginning of the integration time so that transient process maybe ignored, these functions depend only 
on time differences. In particular, Ci(t) becomes a constant and C(ti, depends only on the t = t2 — ti. In this way, 
we can reconcile the experimentally defined fluorescence intensity at time t and the auto-correlations of the previous 
section. 

The distribution function of the time interval At between two successive photon events, p(At), referred to as 
At-distribution, can also be extracted the probability generating functional 



const. 1 ^ S 
~T^Jo ^Sz{t) 6z{T + Aty 



p(AO = I dTj^^^:j-^^g[zit)] Ut)=at\r-At), (4.7) 



where the characteristic function ({t\T,At) = 1 for f e (r, r + At) and ({t\T,At) = otherwise, and the constant is 
determined by the condition 

/ dAtp{At) = 1. (4.8) 
Jo 

We refer the reader to the Appendix A for its derivation. While eqs.(4.7) and (4.8) are valid for a finite T, the limit 
T — > oo will be assumed for their applications. For a Poissonian histogram, (4.7) implies p{At) = Ae~^^*, as expected. 

What is relevant to the actual observation is the detected photons rather than the total number of emitted ones. 
Let Pmi,...,mM denoted the probability of a particular histogram {mi, mj<^} of detected photons. The corresponding 



generating function reads 

Q.n.{z,,...,Zu)= P^^m,...mj,z"'\..z"^^, (4.9) 
mi,...,m^r 

As is shown in the appendix A, under the assumption that aU photon counts are statisticaUy independent, we find a 
simple relation 

ges.{zi,...,z^r) = g{l - J] + rizi,..., 1-7] + T]Zx) (4.10) 
with T] the efficiency of the detector. In the limit — > oo, it becomes 

QeS.[z{t)]=S[l-V + Vz{t)]- (4.11) 

B. A Quantum Mechanical Analog 

The formulation we have established thus far is completely general, and independent of details of the fluorescence 
diffusion process. We shall now include the details of the physical process and model the generating functional ^[^;(t)]. 
Consider N molecules, each having a specific brightness, and diffusing in a solution of total volume O. Both Q — > oo 
and N ^ 00, at a. fixed concentration c = An axially symmetric intensity profile is created by focusing the 
laser beam within the sample solution. While fluorescence occurs everywhere along the beam volume, the pinhole 
effectively eliminates the collection of photons emitted far away from the focal point [12] and a small detection 
volume is defined, which contains few molecules in average at all times. In the absence of chemical reactions (no self 
hybridization), sufficiently weak laser intensity and sufficiently low concentrations, we may assume that i) N molecules 
do not interact with each other; ii) photon emissions are statistically independent in the absence of diffusion; and 
iii) the photon emission frequency per fluorophore within the detection volume is Xou{f). u{r) will be referred to as 
the fluorescence profile function and is normalized by the condition w(0) = UMax.(r) = 1- Under these assumptions, 
the probability generating functional takes the same mathematical expression of the transition matrix element of a 
quantum mechanical system of N noninteracting bosons in an external potential field and imaginary time, i.e. 

QW)] = < |re-r'^*«(*)| >, (4.12) 

where T enforces the time ordering and f2 is the volume of the solution and will be sent to infinity for all practical 

purposes. The Hamiltonian operator of the analog quantum mechanical system reads 

N 

H{t) = Y,Krj,t) (4.13) 



and 

h{f, t) = -DV^ + [1 - z{t)]Xu{f), (4.14) 

where D is the diffusion constant and A = fcAo with k the effective number of fluorophores per molecule. The wave 
function of the state | > of the analog quantum mechanical system is of zero momentum and is normalized in the 

coordinate representation to 

<fi,...,rA/-| >= 1. (4.15) 
The full derivation of (4.12)-(4.14) are presented in the appendix B. We notice that: 

1) . The Hamiltonian operator (4.14) describes the motion of a particle of mass (2D)^^ moving in an external 
potential [1 — z{t)]Xu{r}. Two kinds of expansions can be developed for the statistical analysis. The first is a per- 
turbative expansion according to the powers of 1 — z{t), which generates the correlation functions to all orders. The 
second is the expansion according to the powers of the diffusion constant, D, which is particularly useful for FCS with 
biological molecules. The leading order of the second expansion corresponds to the frozen limit in the literature [7,8] 
and we are able to add the higher order corrections systematically following this quantum mechanical analog. 

2) . It follows from (4.11) and (4.14) that the generating functional responsible for the observed time delay his- 
togram, Ge«.[z{t)], assumes the identical mathematical form as ^[^(t)], provided A is replaced by X^g, = rjX. In what 
follows, we shall refer exclusively to 0eff.[-2(i)] with the subscript "eff." suppressed. 

3) . The generating functional (4.12) can be factorized for each aggregate, i.e. 

g[z{t)]=g''[z{t)] (4.16) 

with 

g[z{t)] = ^<\Te-Io'''^'''^\> (4.17) 

and < r\ >= 1. Alternatively, g[z{t)] can be calculated by integrating the wave function ip{r^t) that solves the 
Schroedinger equation with an imaginary time, 

^Mr,t) = -h{r,t)^{r,t) (4.18) 



subject to the initial condition, tp{r, 0) = 1, i.e. 



9[z{t)\ = ^j d^mr,T). (4.19) 

The differential equation (4.18) and its initial condition below can be converted to an integral equation by treating 
the potential term of /i(r, t) as a source of the diffusion, 

V(f,i) = 1 + -A^ /' dt')-^ f d^f'e-^^u{f^)i,{f^,t'). (4.20) 
(47r£))5 Jo {t -t')^ J 

It follows from (4.19) that 

g[z{t)] = 1 + ^ j\t'[l - z{t')] J d^fu{fMf,t'). (4.21) 

As the main contribution to the integral comes from the detection volume specified by u{f), we find 

g[z{t)] - 1 = 0{l/n) (4.22) 

in the limit Q — > oo with a fixed detection volume. Taking this limit at a fixed concentration, c, and using the 
standard limit, limAr^oo(l + jf )^ = e^, we obtain that 

g[z{t)] = e-^[^Wl (4.23) 

with 

J^[z{t)] = c < 1 1 - Te" r I > . (4.24) 



C. Generalization to Multi-species and Multi-channels 

We generalize the present formulation to include several species of fluorescent molecules with multiple channels of 
detection. Assuming there are M species each labeled by an index I and K detecting channels each labeled by a, the 
Hamiltonian of the analog quantum mechnical system, (4.13), becomes 

M Ni 

H{t)=J2Y^hi{rj,t) (4.25) 

1=1 3 = 1 



with 

K 

hi{f,t) = -DiV^ + - Zc{t)]Xic,{f) (4.26) 

a=l 

where Xia{r) the profile function specific to the Zth species and ath channel, and it becomes Xu{f) for a single species 
and a single channel. The generating functional (4.12) factorizes into a product of a single species, where each now 
depends on several arbitrary functions Za{t). The power series expansion according to 1 — Za{t)'s yields all the 
corresponding correlation functions. Unlike number counting in the frozen limit, it can be factorized into individual 
detecting channels for nonzero diffusion constants. 

IV. ANALYTICAL EXPRESSIONS FOR DATA ANALYSIS 

In this section, we shall display the analytical expressions for fluorescence intensity, the auto-correlation function, 
Mandel's Q parameter and the At distribution, as derived from the general formulation of the previous section. The 
technical details of the derivation are deferred to the Appendix C. 

A. Fluorescence intensity, correlation functions and the binning statistics 

In accordance with eqns.(4.4) and (4.5), the ensemble average of the fluorescence intensity reads 

/ = cAy" cffuif) = nX, (5.1) 

where n = cv with v = J d?ru{r). The integration v can be viewed as the effective collection volume defined by the 

focused beam and pinhole and n, the average number of molecules within the volume. The experiments reported in 
this article are characterized by n 1. 

Applying the definition (4.4) and the quantum mechanical analog (4.12)-(4.14) of ^[^(t)], the second order correlation 

C2{ti,t2) takes the form 

with up = J d^fe~^^"^u{r) the Fourier transformation of the profile function u{r). Substituting (5.1) and (5.2) into 
(4.6) for the auto-correlation function, we find: 



1). For an arbitrary u{r}, the auto-correlation function at zero time lag takes a simple form 

^(0) = - 



(5.3) 



revealing the number or concentration of objects within the volume and Z the geometrical factor for that volume 
defined by 



J dP"ru{r) 

For nonzero time lag, we shall parametrize the auto-correlation function as 



(5.4) 



with ^(0) = 1. 



A{t) = -A{t) 



(5.5) 



2). For a 3D Gaussian fluorescence profile, 



u{f) = e 



xW._ 



we obtain that Z = and [1] 



A{t) = 



with Tj_ = io±/D and ry = Wy/D. For a Gaussian-Lorentzian profile, 

,2 ^2j.„2 



u{r} = 



we find that Z — j and 



A{t) 



with 



Erfc(0)= / 

J z 



(1 + ^)V r 



2 e 
dxe~^ = — 
2z 



Til -!- 



(5.6) 



(5.7) 



(5., 



(5.9) 



(5.10) 



For r << T|| and t± « ry, both (5.7) and (5.9) can be approximated by 



Extending the same analysis to the third order coefScient of the expansion of In^ according to the power of z{t) — 1, 
we find the third order correlation function for t± << t]|, i.e. 

c.(.,.....)^ „^^„f;^,_. . (5.12, 

where the constant Z' is another geometrical factor, like Z for the autocorrelation, t = ta — h, t' = tb — tc with ta, 
tf, and tc a permutation of ti, t2 and ^3 such that ta > tf, > tc- 

Using Eqn(4.6), we obtain the expression of the Q parameter which agrees with that of FIMDA [13]. For r << ry 
it can be approximated by 

S = 2ZXr^ f ^^±^ In ^^±^ - l) . (5.13) 

with Tb the size of the binning window and the dependence on different models of longitudinal profile absorbed in the 
constant Z. The distribution of photon counting numbers within a time bin can be extracted from the generating 
function G{z), obtained from the generating functional Q[z{t)] by restricting the form of z{t) such that it equals to a 
constant z withing the time bin and vanishes elsewhere. We then have 

G{z) = e-^-^f^li-^) (5.14) 

with 

i^(T|C) = - < |l-e-^''(^)| > (5.15) 
n 

and 

h{C) = -DV^ + CXu{r). (5.16) 

While an analytical expression for F{t\Q does not exist in general, the expansion according to diffusion constant can 
be obtained easily, 

F{r\0 = (XrfiCXr) - X^r' f {(Xr) + O(^) (5.17) 

with 

4 



f\^\jL-i, (5.18) 
SV'T Jo t. 



where the leading term corresponds to the frozen limit in [13] and the second term improves the approximation further. 



B. The time-delay histogram 



Carrying out the functional derivatives in the formulae (4.7) with the the aid of (4.16) and (4.17), the distribution 
function p(At) becomes 



pm 



1 <f 

An dAi2 



g-nF(At|l) ^ ^ 



+ n(- 



dF 



-nF{At\l) 



(5.19) 



Using expansion (5.17), we derive an approximate expression of p(At) which is valid for Arj_ >> 1 and Af << tj_, 

Inp(At) = Inp(O) - nxf{x)+\n{[f{x)+xf{x)f - -[2f{x)+xf"{x)]} 

n 

1,1 3.,. ^ , 2xnx) + 2x^f"{x) + \x^r"{x)-2nx^[f{x)+xr{x)][nx)-\xr{x)] ^ 
^\T^ 3«^/W+ _2f^a:)-xf"{x)+n[f{x)+xf'{x)f ^ ^^"'"^ 

with X = AAt. Alternatively, a Taylor expansion of F in AAi yields an expression of p(At) for AA< << 1, which 

applies to the case with arbitrary Arj,. 

For Taylor expansion of the function F according to the power of At, we obtain 



where 



Inp(At) = Inp(O) - (n + 7)AAi + -{(3 - Y)^^t + O(At^), 



3-i+2-in+|^ 



(5.21) 



and 



/3 = (2-2 +n)- 



2-i +n 



i + (I + 2 • 3"5)n + 2"5n^ + 



o 4 -3-5+2-2/1 

-I 

Ar^ 



(5.22) 



If the histogram were a Poissonian, a single exponential would be expected, which corresponds to /3 = 7 = 0. The 
parameters /3 and 7 represent the deviation from a Poissonian which do not vanish even in the frozen limit, i. e. 
Tj_ 00. At high density, on the other hand, n » 1, /? = 0{n) and 7 = 0{1), eq.(5.21) can be written as 



lnp(Ai)=lnp(0)-2 + O( — ), 

n 



(5.23) 



with z = nAAtwhich agrees with that of a single Poissonian. 



V. EXPERIMENTAL MATERIALS AND METHODS 

To create well-defined single or double fluor elements, short pieces of single stranded DNA (ssDNA) were used as 
substrates. Either one or two fluorophores can be site-specifically coupled to each DNA oligomer, (29 bases in length) 



dependent on the number of end-strand modifications (primary amino linker arms, Midland Certified Reagent Co.). 
Succidinmyl ester Rhodamine 6G (Molecular Probes) was coupled to the modified sites in the presence of DMF, and 
purified by gel filtration and reverse phases chromatography (HPLC). 

The experimental setup (Fig. 2) is an inverted confocal microscopy arrangement. The sample is illuminated by the 
514. 5nm line of a Ar+ laser (Lexel 85) focused through a 60x water immersion objective (numerical aperature 1.2, 
Olympus). Incident power was empirically optimized at 100/xW, so that the photon counts per aggregate were at 
least 50,000 cps, while avoiding significant population of the triplet state or bleaching. Emitted photons are collected 
through the same objective, directed through a high-pass dichroic mirror (Omega Optical) and a notch filter (Kaiser 
Optical) to reduce collection of on-axis elastically scattered photons. The collection volume is further refined by 
focusing the light onto a 25/im pinhole, eliminating off angle scattering as well as spatially defining the collection 
volume. The collection volume was empirically determined though the number of molecules measured through FCS 
as a function of increasing concentration. All FCS measurements were 10 minutes in duration using a ALV 5000 E 
board for data collection and in-house data analysis software for fitting. The overall detection efficiency of the setup 
is estimated to be 3 percent. Photons are detected by a counting avalanche photodiode (EG&G/Perkin Elmer), 
pulsewidth 25ns whose signal is processed by task specific counting board (NI-TIO 6602 National Instruments), 
controlled by Lab VIEW. The period between each photon is measured by counting the number of external clock 
(4MHz) pulses (source) between each photon pulse (gate). 2^^ photons are collected for each trial. Data acquisition 
technology limited the total collection time to l-2s, depending on the incident intensity. Afterpulsing artifacts from 
the photodiodes were measured ~ 100ns, with comparison to cross-correlation of the same signal in two perpendicular 
detectors. Digital filtering was used to subtract the afterpulsing noise from the final histogram statistics. DNA 
concentrations are ^nM in PBS buffer such that only one monomer molecule at any time is within the collection 
volume. 

Photon counting data is collected as fluorescent particles diffuse through the sample volume. When the particles 
are outside the volume they are dark and undetectable. As they enter the volume they are excited with a certain 
probability and emit a temporal pattern of photons that are detected by a counter. Given a long integration time, 
many particles diffuse through the volume producing temporal fluctuations in fluorescence. Whereas the autocorre- 
lation function reveals the timescale of these fluorescence fluctuations, the probability distribution characterizes their 
amplitude. 



VI. RESULTS 



Photon counting data was collected for two systems, and the time delay histogram constructed for each (Fig 3). 
Both samples consist of very dilute (nM) identical sequences of ssDNA in buffer, and observations were made at room 
temperature ( 25 °C). We consider these systems noninteracting. In the first sample, specifically single end-labeled 
ssDNA was diluted to an average concentration of 1 molecule per collection beam volume at any time. The number of 
molecules was calculated from calibration of the collection volume (see Methods) at incident power (0.35/LtTO'^). 

The second sample contained the same sequence ssDNA however tagged with two fluorophores per object (one at 
each end). This sample was then diluted so that the average intensity in time of the two samples were equal. If 
the dimer system was exactly twice the fluorescence of the monomer, then the dimer concentration should be 0.5 
that of the monomer for the same given intensity. However, due to local quenching effects of the fluorophore, the 
average concentration of double-dyed molecules in the beam at any time was calculated to be 0.7 to maintain the 
same intensity. The empirical volume of the beam was calibrated from a serial dilution of a standard dye solution, 
and the rate of emitted photons per fluor,A, measured. 1/X — l/60kHz. Concentrations of single/double fluorescent 
aggregates in the beam were 1 and 0.7 molecules respectively. 

Empirical measurements demonstrated the quenching could be minimized by hybridization of the ssDNA to a 
non-fluorescent target. Hybridization of such a short segment (below the persistent length of dsDNA) forces the two 
fluorophores farther apart, minimizing dye-dye interaction. However, double-dyed molecules never demonstrated a 
full factor of 2 increase in fluorescence over their single-dyed conterparts. We suspect local quenching interaction of 
the fluor with the nearby base of the target strand. 

The characteristic diffusion time through the sampling volume was extracted from the decay of the autocorrelation 
function. Due to the resolution of our correlator software (10~*s — 10~^s), we are most sensitive to the diffusion 
across the short axis of the beam profile. Hence, the characteristic time extracted from time correlation of the data 
is most representative of t±. Through FCS measurements, we find that t± ~ 300/is for both single-dyed molecules 
and double-dyed molecules, Using a random walk simulation and a geometical approximation for the long axis of the 
beam profile, we estimate ry to be 100t_l. 

In all theoretical curves of the figures below, we use A = 60kHz for specific brightness of the single-dyed molecules 
and an enhancement of 1/0.7 ~ 1.4A for the double-dyed molecules. The transverse diffusion time t± = 300/i s is 
substituted into the theoretical formula for for both single-dyed and double-dyed molecules. 



At first glance, the two distributions of Fig. 3 are completely indistinguishable. However upon closer examination 
it is possible to see the two curves slowly diverge at large At with the double dye data falls slightly above the single 
dye data. The comparison between the analytical expression of the time delay histograms with large and small At 
approximations (5.20) and (5.21) is shown in Fig. 4, where the ratio of the experimental histogram to the theoretical 
one is plotted versus the time delay At. It is evident that the formula (5.19) together with the approximation (5.17), 
denoted as Ptheory(At) in the Figures, is robust for single (A) and double dyed (C) samples. (B) and (D) are exploded 
views of the small At domain of each histogram. The curve labeled by "linear" or "quadratic" corresponds to the 
theoretical formula (5.21) with terms beyond linear or quadratic truncated. The quality of the agreement is improved 
from the linear truncation to the quadratic truncation. The curve labeled by "Poisson" corresponds to /3theor.(At) 
given by a Poissonian, i.e. In ptheor. {At) = In ptheor. (0) — XAt, which is clearly a poor description of the experimental 
histogram. 

Although these simulations successfully differentiate the two systems, the analysis is somewhat cumbersome. A 
common mathematical technique to highlight the subtle differences in distributions is moment analysis. Similar 
techniques have proven useful in fluctuation spectroscopy [14-16]. The first moment is the mean, the second moment 
the standard deviation, the third the skewness, etc. 

Returning to Fig. 3, we note a profound difference between the experimental data and a single Poissonian process. 
The straight lines represent two hypothetical single Poission systems with different timescales. For the simple detection 
of emitted photons within a fixed volume one might expect the statistics to resemble a single Poissonian process. [17] 
However, when the particles are allowed to diffuse through the boundaries of the volume, an additional Poissonian 
process contributes to the overall photon statistics. [18,19] Not only must we account for the stochastic nature of the 
emission process, we must also consider the number distribution of aggregates passing into the beam volume from the 
larger sample reservoir. The statistics of the time series of the photon counting can be highlighted through Mandel's 
Q parameter (Eqn 2.5) introduced in section II. 

We develop the binning moment as a complementary technique to FIMDA. In FIMDA [11] each histogram is 
representative of the number count distribution using a certain binning window size. For every change in binning 
window size, a new histogram is constructed. Likewise, every histogram has its own unique set of moments. All 
first moments should be equal to the product of the average intensity and the the window size and will not show 
any difference between the two systems (single dye and double dye) in accordance with the experimental procedure 
described in the first paragraph of this section. Starting with the second moment, the difference between the two 



system emerges. In Fig 5, we plot the second moment normalized according to (2.5) for both system and the 
corresponding theoretical curve given by (5.13). The data for the two systems are clearly distinguishable and agree 
well with the theoretical prediction. 

If the photon histogram were a single Poissonian, all correlations as well as the Q parameter would vanish. Therefore 
the correlation functions and the Q parameter measure the deviation of the photon histogram from a Poissonian. 
Recall that the time-delay histogram is essentially the result of a two Poissonian processes. If we re-examine Fig 
3 (although it is a time delay, not number counting histogram) and focus on the dashed line through the short At 
domain, divergence from the single Poisson increases as At increases. For binning windows smaller than the diffusion 
time, the statistics are primarily due to the intrinsic fluorescence of the fluorophore. Since both systems contain 2 
fluors, the two distributions vary little in this domain. At longer times, each molecule samples the inhomogeneous 
beam profile as it diffuses. Hence the diffusion dependent statistics dominate the long time domain and are responsible 
for the notably different Q parameters of the samples. 



VII. SECONDARY EFFECTS 

In traditional fluorescence correlation spectroscopy, triplet state effects and various quenching processes are the 
principal mechanisms that overshadow structural information. To parse out these contributions, one may need to 
explore the higher binning moments than have been examined in this work. Our mathematical formulation provides 
the complete systematics for this purpose. Such secondary effects introduce additional timescales to the problem 
(not simply the fluorescence rate and the diffusion time to cross the collection volume discussed in this paper). Also, 
some details of the electronic transition inside a fluorophore should be addressed. This modeling can be achieved by 
enlarging the quantum mechanical analog with a multi-component wave function and a matrix Hamiltonian 

hij = -D\7^6ij+Vij (7.1) 

where 

!z{t)Wij if i — > J is a fluorescence transition; 

Wij if 2 ^ j is not a fluorescence transition; (7.2) 

Each component of the wave function, ipi represents the probability of a DNA molecule at a particular spatial location 
with its fluorophore in the i-th electronic level and Wtj the transition rates from ith jth of electronic levels. 



In principle, such an elaboration should also be implemented in the absence of triplet state effects and quenching 
processes, since the fluorescence rate combines the excitation rate from the ground state and the spontaneous emission 
rate from the excitation levels. For two electronic levels with '0' labeling the ground state and '1' the excitation state, 
we have Woi = Xu{f) and Wio = As, Einstein's A-coefficient. The Schroedinger equation (4.18) is split into two 

components: 

= (W - Au)Vo + 2(i)AsV'i (7.3) 

^Vi = (^V^ + A,«)Vi - AuV'o (7.4) 

The approximation employed in previous sections amounts to A^ >> A and A >> 1/t±, the diffusion time. In 
this case, the spontaneous emission is almost instantaneous once the fluorophore is excited and the second equation, 
(7.4), gives rise to Xsi>i — Xuipo at equilibrium. Substituting it back (7.3), we obtain (4.18) with h given by (4.14). 
For the experimental data presented in this paper, a 60kIIz photon time delay histogram at 3% detection efficiency, 
the excitation rate is 60/3%=2MHz, and the corresponding fluorescence time is 500ns, much longer than the typical 
spontaneous emission time, 10ns. Our approximation is therefore adequate. 



VIII. CONCLUDING REMARKS 

We have developed a mathematical formulation to analyze the time delay series of fluorescence photons from 
diffusing particles, based on a probability generating functional and its quantum mechanical analog. Although it may 
appear formal, since some analytical expressions such as the auto-correlation function have been obtained by less 
sophisticated means, the approach is systematic. The potential of this general approach will be realized when dealing 
with systems of greater complexity, e.g. in the presence of the chemical reaction discussed below. 

We have designed an experiment to differentiate fluorescent aggregates in solution. ssDNA monomers are labeled 
with a single fluorophore and dimers with two fluorophores. Secondary effects such as chemical reactions, triplet state 
effects and various quenching processes have been neglected in our model. However these effects have been minimized 
by using dilute solutions to avoid self-interaction and Rhodamine 6G, a fluorophore with little triplet state at low 
incident intensities. 

Although we have only addressed ideal experimental conditions of the non-reacting case in this manuscript, typical 
biological/chemical systems react (aggregation or cooperative binding). Our technique can be modifled to include 



these reactions. One must generalize the quantum mechanical analog to the case with several species of particles, 
each representing a fluorescence molecule, interacting with each other. Without going to technical details which will 
be reported elsewhere, we quote the generalization of our formulation analog in the presence of a binary reaction, 

2A B. (8.1) 

The Hamiltonian of the quantum mechanical analog in (4.12) is given by 

H = HdiS. + -fffluor. + -f^chem., (8.2) 



where 



and 



^^diff. = j d^r{DAVi^ ■ VV' + DbV^ ■ V(t>), (8.3) 
^^fluor. = [1 - z{t)] J rfVu(r)[(A^^V + Ab#) (8.4) 

^^chem. = J d^'r J d\' j d^Ra{R\fy)^{R)i>{f)ij{f) 

~\ j J j '^^^'^'(^'^l'^)V'(^V'(r=')<^(^) 

+ i / d'V / j d^Ra{R\r,r')^{f')i){r)'ti^{r)i^{r') 



+ i j d^r j d^r' j Ra' {f,r'\R)(t>{R)(t>{R), (8.5) 

and the generating fimctional is no longer factorizablc. In this Halmitonian, the pairs of operators (ip^ip) or are 
the creation/annihilation operators of a molecule of species A or B. -Da(b) and \a(b) are the diffusion constant and 
the specific brightness of the species A(B). The first term of the reaction part, .ffchem represents the creation of a B- 
molecule, the second term represents the creation of a pair of A-molecules, the third term signifies the annihilation of 
a pair of A-molecules and the last term signifies the annihilation of a B-molecule. The function a{R\f, f*) or a'{-r,r'\R) 
denotes the reaction rate in each direction of (8.1). In physics, the Hamiltonian (8.2) describes a system of interacting 
bosons of two species. The equilibrium state of the fluorescence-diffusion-reaction process will be analogous to the 
ground state which carries a Bose condensate. The proportion of each species in the condensate is determined by 
the mass-action law and the fluctuations are calculatable with well developed field theoretic method. In terms of the 



generating functional (4.2) and the functional derivative (4.4), we are able to calculate various correlation functions 
of the photon counting histogram in the presence of chemical reactions using techniques developed in quantum field 
theory. This approach is expected to be more systematic than the conventional reaction kinetics. 
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Appendix A 

A.l The derivation of the formulae for distribution function of the time delay histogram. 

To derive the At distribution (the distribution function of the time interval between two successive photon emis- 
sions), we start with the case with finite time bins (e is sufiiciently small that the probability of more than one photons 
within a bin can be ignored.) and look for the probability of one photon event in n*'*-bin, one photon event in (n + /)*'* 
bin with At = {I — l)e and no photon in the bins between them. Up to a normalization constant, the probability is 

Pnl — const. ^ ] -Fmi...mn-ilO...,01m„_l_l_|.i...mjv' 

mi,...,mn-i)Tnn+i+i5---,^A/' 

d d 

= e(l,...,l,^n,0,...,0,z„+i, !,...,!). (Al) 

By summing over n with a fixed I, we find the probability of successive I — 1 empty bins 

Pi =^ Pnl- {A.2) 

n 

Taking the limit of infinitesimal bins, i.e., e ^ at fixed T and At, we obtain the desired distribution function (4.7). 
A. 2 Detector effect 

Consider the case of a single time bin, \.e.J\f = 1, we have 

C30 

m— 

and 

CXD 

m=0 



Under the assumption that each photon counting by the detector is statistically independent of others, the probability 
of detecting m photons out of n incident photons is 

n! 



m!(n — to)! 

with T] the detector efficiency. Therefore 



r/'"(l-r7)"-™ (A5) 



CSO , 

7,' 



miin. — mil 



n=m 



' m\{n — to)! 
Substituting (A. 6) into (A. 4), we obtain that 

m!(n — ml! m!(n — m ! 

m— n—m ^ ' n— m— ^ ' 

CO 

= E^n(i-?? + ry^)" = e(i-ry + 7y^). (A 7) 

Following the same steps for each variable in the case with > 1, we end up with (4.10) and (4.11). 

Appendix B 

The probability generating functional of the photon emission histogram from A'' identical fluorescence molecules is 
modeled according to the following two principles: 

1) . The generating functional with N nonreacting molecules 

QW)\=g''{z{t)\ {B.\) 

with g\z(fy\ the generating functional of one molecule. 

2) . The generating functional of one molecule, 

c 

where (7c[2:(i)] denote the generating functional along a particular diffusion path, C, and Pc the probability of the 
path. 

Dividing the integrating time T into J\f time bins with to = 0, tn = ne, tj^ = Me = T and specifying a diffusion 
path C by the location of the molecule at each instant, (to, ■■■,tn, T), i.e. 

C = {fo,...,f„,...,rV}. {B.3) 



For sufficiently small e, the probability of more than one photon in each bin may be ignored and we have 



gc[z{t)] = liin TT [1 - eXu{fn) + eA^„u(f„)] 

e— »0 



= lime'^'^l^o'^^"-^^"^'""^ (BA) 
The probability of the path C is entirely determined by diffusion. Since the probability for a molecule to diffuse from 

fn at tn to fn+1 at tn+1 is 

n=0 

It follows from (B.2), (B.4) and (B.6) that 

1 , 1 ^ ML f ■'^ „ 



with 

1 f fn+i-f„ 



where we have taken the average of the intial location of the molecule, tq over the volume of the solution. Mathemat- 
ically, eq.(B.7) and (B.8) present a path integral of a quantum mechanical particle moving in an external potential 
[1 — z{t)]Xu{f) in an imaginary time. A similar path integral has been used to describe fluorescence correlations. [20] 
Following the standard procedure [21], we may cast (B.7) into the canonical form 

g[z{t)] = ^<\Te-Io''"'^'^\> {B.9) 

where T is the time ordering operator, 

hit) = -DV^ + [1 - z{t)]Xu{r) {B.IO) 

is the analog of the quantum mechanical Hamiltonian operator and |0) is the analog of a quantum mechanical state, 
whose wave function is < f| >= 1. 

Finally, we would like to explain the operator T in more detail, when acting on a product of operator functions of 
time, it arranges the order of these operators according to the descending order of their time arguments, i.e. 

T0{ti)0{t2)...0{tn) = 0{tp,)0{tp,)...0{tpj {B.ll) 



with Pi,P2, ...,Pn a permutation of 1,2, ...,n, such that tp^ > tp^ > ... > tp^. This property, when appUed to the 
Taylor exponential operator in (B.9), yields: 

Te" Jo = 1 - y dth{t)+ I dt2 I dtih{t2)h{ti) + ... 



JO 



+(-) 



pT pin rt2 

" / dtn dtn-1... / dtih{tn 

Jo Jo Jo 



)h{tn-l)...h{ti) + ... 



{B.12) 



Appendix C 



For a time independent operator A and a time dependent operator B{t), the following identity holds with the time 
ordering product: 

Te- /o dr[A+Bir)] ^ ^-tA <ir6(r)l ^^ -^^ 

with B(t) = e^^B(r)e-^^. 

The expansion of the functional (4.21) that generates all correlation coefficients for a non-reacting system follows 
from the identity (C.l) with A = -DS/'^ and B = [1 - z{t)]Xu{f). We find 



J^[z{t)]=c\ [ dt{l- Z{t)) <\u\> - ( dt2{l - Z{t2)) I 2dti{l- Z{ti)) <\ 

'-Jo Jo Jo 



and 



I = Ci{t)=cX < \u\ >, 



C2(t,i') = cA'<l«el*-*'l^^'u|>. 



(C.2) 



(C.3) 



(C.4) 



This expansion is parallel to the perturbative expansion of the quantum mechanical analog. 

The expansion in the diffusion constant for a non-reacting system is obtained by applying the identity (C.l) with 
A = —Xu, B = —DV^ and t = At, and making a Taylor expansion of the second factor on the right hand side of 
(C.l). 



nF{T\C) = c 



< |1 - e-^^^"| > - [ ds< |e-C^(-^-«)»£)V^e-*f-^^"| > +. 

^0 



{C.5) 



This expansion corresponds to the strong coupling expansion of the quantum mechanical analog. 

The calculation of the expectation value < |...| > in (C.2) and (C.5) facilitated by switching between the coordinate 
and momentum representations of the quantum mechanical analg. The state | > is the state of zero momentum and 
is normalized the the volume of the system. 



IX. FIGURES &; CAPTIONS 



1. Schematic of a photon counting trace. Traditional photon counting divides the total integration time into bins, 
counting the number that fall into each. We propose a new type of counting based on the time delay between two 
successive photon counts. 

2. Schematic of the photon counting setup. OBJ=objective, DM=dichroic mirror, NF=Notch Filter, PH=Pinhole, 
APD=Avalanche Photodiode, CB=Counting Board, CO=Correlator Board 

3. The time-delay histograms for single (black) and double (grey) labeled ssDNA. The resulting distributions from a 
single Poisson processes are shown by the straight lines for comparison. 

4. (A)large At single fluor, (B)small At single fluor, (C)large At double fluor, (D)small At double fluor. (Solid) 

Poisson, (short dash) linear, and (long dash) quadratic fits are shown for comparison in the short time limits. 

5. Q Parameters of (•) single dye and (A) double dyed aggregates. A cartoon of each system is displayed next to the 
corresponding curve. 
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